Search CORE

39 research outputs found

Gene network reconstruction from microarray data

Author: AV Werhli
B Efron
Florence Jaffrezic
Gwenola Tosser-Klopp
J Hausser
J Schäfer
J Whittaker
R Opgen-Rhein
W Swinkels
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Often, software available for biological pathways reconstruction rely on literature search to find links between genes. The aim of this study is to reconstruct gene networks from microarray data, using Graphical Gaussian models. Results The <it>GeneNet </it>R package was applied to the Eadgene chicken infection data set. No significant edges were found for the list of differentially expressed genes between conditions MM8 and MA8. On the other hand, a large number of significant edges were found among 85 differentially expressed genes between conditions MM8 and MM24. Conclusion Many edges were inferred from the microarray data. Most of them could, however, not be validated using other pathway reconstruction software. This was partly due to the fact that a quite large proportion of the differentially expressed genes were not annotated. Further biological validation is therefore needed for these networks, using for example in vitro invalidation of genes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ProdInra

Inferring signalling networks from longitudinal data using sampling based approaches in the R-package 'ddepn'

Author: AV Werhli
C Bender
Christian Bender
CP Paweletz
DS Lee
F Markowetz
F Markowetz
Frauke Henjes
H Fröhlich
H Fröhlich
I Shmulevich
K Murphy
K Sachs
M Kanehisa
MK Cowles
P Sheridan
S Nelander
Silvia vd Heyde
Stefan Wiemann
T Akutsu
Tim Beißbarth
Ulrike Korf
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Network inference from high-throughput data has become an important means of current analysis of biological systems. For instance, in cancer research, the functional relationships of cancer related proteins, summarised into signalling networks are of central interest for the identification of pathways that influence tumour development. Cancer cell lines can be used as model systems to study the cellular response to drug treatments in a time-resolved way. Based on these kind of data, modelling approaches for the signalling relationships are needed, that allow to generate hypotheses on potential interference points in the networks. Results We present the R-package 'ddepn' that implements our recent approach on network reconstruction from longitudinal data generated after external perturbation of network components. We extend our approach by two novel methods: a Markov Chain Monte Carlo method for sampling network structures with two edge types (activation and inhibition) and an extension of a prior model that penalises deviances from a given reference network while incorporating these two types of edges. Further, as alternative prior we include a model that learns signalling networks with the scale-free property. Conclusions The package 'ddepn' is freely available on R-Forge and CRAN <url>http://ddepn.r-forge.r-project.org</url>, <url>http://cran.r-project.org</url>. It allows to conveniently perform network inference from longitudinal high-throughput data using two different sampling based network structure search algorithms.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Seeded Bayesian Networks: Constructing genetic networks from microarray data

Author: AI Saeed
AI Saeed
AJ Hartemink
Amira Djebbari
AV Werhli
D Heckerman
D Husmeier
DC Weaver
DH Wolpert
DM Chickering
DM Chickering
E Frank
G Bastos
J McEntyre
JF Rual
John Quackenbush
JR Nevins
JW Harbour
M Kanehisa
ME Ross
ME Ross
N Friedman
N Friedman
O Gevaert
P Le Phillip
P Shannon
PT Spellman
R Castelo
S Acid
S Aref
S Imoto
T Akutsu
T Chen
T Fawcett
TH Cormen
TK Jenssen
TR Golub
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background DNA microarrays and other genomics-inspired technologies provide large datasets that often include hidden patterns of correlation between genes reflecting the complex processes that underlie cellular metabolism and physiology. The challenge in analyzing large-scale expression data has been to extract biologically meaningful inferences regarding these processes – often represented as networks – in an environment where the datasets are often imperfect and biological noise can obscure the actual signal. Although many techniques have been developed in an attempt to address these issues, to date their ability to extract meaningful and predictive network relationships has been limited. Here we describe a method that draws on prior information about gene-gene interactions to infer biologically relevant pathways from microarray data. Our approach consists of using preliminary networks derived from the literature and/or protein-protein interaction data as seeds for a Bayesian network analysis of microarray results. Results Through a bootstrap analysis of gene expression data derived from a number of leukemia studies, we demonstrate that seeded Bayesian Networks have the ability to identify high-confidence gene-gene interactions which can then be validated by comparison to other sources of pathway data. Conclusion The use of network seeds greatly improves the ability of Bayesian Network analysis to learn gene interaction networks from gene expression data. We demonstrate that the use of seeds derived from the biomedical literature or high-throughput protein-protein interaction data, or the combination, provides improvement over a standard Bayesian Network analysis, allowing networks involving dynamic processes to be deduced from the static snapshots of biological systems that represent the most common source of microarray data. Software implementing these methods has been included in the widely used TM4 microarray analysis package.</p

Crossref

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Constructing non-stationary Dynamic Bayesian Networks with a flexible lag choosing mechanism

Author: A Bernard
A Hall
A Nobile
A Para
AJ Hartemink
AV Werhli
CA Benedict
D Heckerman
D Husmeier
F Guo
H Duan
H Yu
HH McAdams
J Yu
Jr JD
Jun Huan
JW Robinson
K Honda
K Murphy
M Grzegorczy
M Zou
MF Covington
MN Arbeitman
N Friedman
N Nariai
P Mas
PA Salome
PJ Green
RM Cripps
S Chib
S Imoto
S Imoto
S Raza
SY Kim
T Mizuno
T Sandmann
W Zhao
W Zhao
Yi Jia
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Dynamic Bayesian Networks (DBNs) are widely used in regulatory network structure inference with gene expression data. Current methods assumed that the underlying stochastic processes that generate the gene expression data are stationary. The assumption is not realistic in certain applications where the intrinsic regulatory networks are subject to changes for adapting to internal or external stimuli. Results In this paper we investigate a novel non-stationary DBNs method with a potential regulator detection technique and a flexible lag choosing mechanism. We apply the approach for the gene regulatory network inference on three non-stationary time series data. For the Macrophages and Arabidopsis data sets with the reference networks, our method shows better network structure prediction accuracy. For the Drosophila data set, our approach converges faster and shows a better prediction accuracy on transition times. In addition, our reconstructed regulatory networks on the Drosophila data not only share a lot of similarities with the predictions of the work of other researchers but also provide many new structural information for further investigation. Conclusions Compared with recent proposed non-stationary DBNs methods, our approach has better structure prediction accuracy By detecting potential regulators, our method reduces the size of the search space, hence may speed up the convergence of MCMC sampling.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

KU ScholarWorks

PubMed Central

Using Stochastic Causal Trees to Augment Bayesian Networks for Modeling eQTL Datasets

Author: AFM Smith
Ambuj K Singh
AS Dimas
AV Werhli
BE Stranger
BJ Chen
D Heckerman
D Husmeier
D Husmeier
D Madigan
DC Kulp
DJ Lockhart
DM Ruderfer
E Chaibub Neto
EE Schadt
EO Perlstein
GA Churchill
J Pearl
J Zhu
J Zhu
J Zhu
JD Storey
JJ Faith
JJ Keurentjes
Kyle C Chipman
M Ashburner
M Morley
M Schena
MH Kutner
N Bing
N Friedman
N Friedman
O Litvin
RB Brem
RB Brem
RC Jansen
RW Doerge
S Imoto
S Mukherjee
SI Lee
W Pan
W Zhang
W Zou
Y Benjamini
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The combination of genotypic and genome-wide expression data arising from segregating populations offers an unprecedented opportunity to model and dissect complex phenotypes. The immense potential offered by these data derives from the fact that genotypic variation is the sole source of perturbation and can therefore be used to reconcile changes in gene expression programs with the parental genotypes. To date, several methodologies have been developed for modeling eQTL data. These methods generally leverage genotypic data to resolve causal relationships among gene pairs implicated as associates in the expression data. In particular, leading studies have augmented Bayesian networks with genotypic data, providing a powerful framework for learning and modeling causal relationships. While these initial efforts have provided promising results, one major drawback associated with these methods is that they are generally limited to resolving causal orderings for transcripts most proximal to the genomic loci. In this manuscript, we present a probabilistic method capable of learning the causal relationships between transcripts at all levels in the network. We use the information provided by our method as a prior for Bayesian network structure learning, resulting in enhanced performance for gene network reconstruction. Results Using established protocols to synthesize eQTL networks and corresponding data, we show that our method achieves improved performance over existing leading methods. For the goal of gene network reconstruction, our method achieves improvements in recall ranging from 20% to 90% across a broad range of precision levels and for datasets of varying sample sizes. Additionally, we show that the learned networks can be utilized for expression quantitative trait loci mapping, resulting in upwards of 10-fold increases in recall over traditional univariate mapping. Conclusions Using the information from our method as a prior for Bayesian network structure learning yields large improvements in accuracy for the tasks of gene network reconstruction and expression quantitative trait loci mapping. In particular, our method is effective for establishing causal relationships between transcripts located both proximally and distally from genomic loci.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

A negative selection heuristic to predict new transcriptional targets

Author: A Karatzoglou
A Polynikis
AA Margolin
AH Brivanlou
AV Werhli
B Liu
B Liu
B Zhang
C Elkan
C Wang
CW Hsu
F Mordelet
H Salgado
H Yu
HC Kim
HT Lin
IH Witten
JJ Faith
JJ Faith
JP Vert
JR Bock
K Basso
L Cerulo
L Cerulo
Luigi Cerulo
M Bansal
M Bansal
M Ceccarelli
M Grzegorczyk
Michele Ceccarelli
P Stegmaier
P Zoppoli
Pietro Zoppoli
RA Irizarry
S Liang
TS Gardner
U Alon
V Matys
Vincenzo Paduano
W Ci
X Li
X Li
X Wang
XL Li
Y Yamanishi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Incorporating Existing Network Information into Gene Network Inference

Author: A Meissner
A Sharov
AA Margolin
AV Werhli
B Efron
BE Bernstein
C Jiang
Cathal Seoighe
D Gilbert
E Segal
ER Mardis
F Mordelet
H de Jong
H Li
H Zou
I Park
J Friedman
J Friedman
J Kim
J Yu
JJ Faith
K Knight
K Okita
K Okita
K Takahashi
K Tan
M Bansal
M Bansal
M Gustafsson
M Stadtfeld
M Wernig
ME Donohoe
N Friedman
N Ivanova
N Tsubooka
O Banerjee
P Tseng
Q Zhou
Qing Nie
R Bonneau
R Bonneau
R Tibshirani
RW Kennard
S Mukherjee
Scott Christley
TS Gardner
TS Mikkelsen
X Chen
X Zhang
Xiaohui Xie
XY Li
Y Chen
Y Pilpel
Y Tamada
Y Wang
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

One methodology that has met success to infer gene networks from gene expression data is based upon ordinary differential equations (ODE). However new types of data continue to be produced, so it is worthwhile to investigate how to integrate these new data types into the inference procedure. One such data is physical interactions between transcription factors and the genes they regulate as measured by ChIP-chip or ChIP-seq experiments. These interactions can be incorporated into the gene network inference procedure as a priori network information. In this article, we extend the ODE methodology into a general optimization framework that incorporates existing network information in combination with regularization parameters that encourage network sparsity. We provide theoretical results proving convergence of the estimator for our method and show the corresponding probabilistic interpretation also converges. We demonstrate our method on simulated network data and show that existing network information improves performance, overcomes the lack of observations, and performs well even when some of the existing network information is incorrect. We further apply our method to the core regulatory network of embryonic stem cells utilizing predicted interactions from two studies as existing network information. We show that including the prior network information constructs a more closely representative regulatory network versus when no information is provided

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Quantitative utilization of prior biological knowledge in the Bayesian network modeling of gene expression data

Author: A Djebbari
A Lechner
AG Fraser
AGME Fraser
AH Tong
AJ Hartemink
AV Werhli
C Alfarano
CJ Burns
CJ Needham
D Heckerman
D Heckerman
D Husmeier
D Madigan
E Steele
G Bastos
GAaXW Xue-wen Chen
GF Cooper
GO Consortium
HW Mewes
I Lee
I Lee
I Simon
IDS Lee
JCY Zhu
JJ Han
JM Servitja
JS Ide
K Murphy
L Franke
M Bansal
M Oti
N Friedman
N Friedman
O Gevaert
OG Troyanskaya
P Domingos
P Larsen
P Le Phillip
P Le Phillip
P Shannon
PL Eyad Almasri
PT Spellman
R Jansen
R Jansen
RJ Cho
RJ Cho
S Gao
S Imoto
S Imoto
S Imoto
Shouguo Gao
SHT Imoto
SL Cao
T Van den Bulcke
TK Jenssen
U Wittig
X Wang
Xujing Wang
Y Tamada
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Bayesian Network (BN) is a powerful approach to reconstructing genetic regulatory networks from gene expression data. However, expression data by itself suffers from high noise and lack of power. Incorporating prior biological knowledge can improve the performance. As each type of prior knowledge on its own may be incomplete or limited by quality issues, integrating multiple sources of prior knowledge to utilize their consensus is desirable. Results We introduce a new method to incorporate the quantitative information from multiple sources of prior knowledge. It first uses the Naïve Bayesian classifier to assess the likelihood of functional linkage between gene pairs based on prior knowledge. In this study we included cocitation in PubMed and schematic similarity in Gene Ontology annotation. A candidate network edge reservoir is then created in which the copy number of each edge is proportional to the estimated likelihood of linkage between the two corresponding genes. In network simulation the Markov Chain Monte Carlo sampling algorithm is adopted, and samples from this reservoir at each iteration to generate new candidate networks. We evaluated the new algorithm using both simulated and real gene expression data including that from a yeast cell cycle and a mouse pancreas development/growth study. Incorporating prior knowledge led to a ~2 fold increase in the number of known transcription regulations recovered, without significant change in false positive rate. In contrast, without the prior knowledge BN modeling is not always better than a random selection, demonstrating the necessity in network modeling to supplement the gene expression data with additional information. Conclusion our new development provides a statistical means to utilize the quantitative information in prior biological knowledge in the BN modeling of gene expression data, which significantly improves the performance.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Integrative modeling of transcriptional regulation in response to antirheumatic therapy

Abstract Background The investigation of gene regulatory networks is an important issue in molecular systems biology and significant progress has been made by combining different types of biological data. The purpose of this study was to characterize the transcriptional program induced by etanercept therapy in patients with rheumatoid arthritis (RA). Etanercept is known to reduce disease symptoms and progression in RA, but the underlying molecular mechanisms have not been fully elucidated. Results Using a DNA microarray dataset providing genome-wide expression profiles of 19 RA patients within the first week of therapy we identified significant transcriptional changes in 83 genes. Most of these genes are known to control the human body's immune response. A novel algorithm called TILAR was then applied to construct a linear network model of the genes' regulatory interactions. The inference method derives a model from the data based on the Least Angle Regression while incorporating DNA-binding site information. As a result we obtained a scale-free network that exhibits a self-regulating and highly parallel architecture, and reflects the pleiotropic immunological role of the therapeutic target TNF-alpha. Moreover, we could show that our integrative modeling strategy performs much better than algorithms using gene expression data alone. Conclusion We present TILAR, a method to deduce gene regulatory interactions from gene expression data by integrating information on transcription factor binding sites. The inferred network uncovers gene regulatory effects in response to etanercept and thus provides useful hypotheses about the drug's mechanisms of action.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Evaluation and improvement of the regulatory inference for large co-expression networks with limited sample size

Author: A Fuente de la
A Reverter
AA Margolin
AL Barabasi
AV Werhli
B Zhang
B-E Perrin
C Olsen
Cristiane P. G. Calixto
D Marbach
D Marbach
F Markowetz
F Steinke
G Altay
H Hache
H Jong de
H Lahdesmaki
H Ma
H Peng
J Linde
J Schäfer
JD Allen
JJ Faith
John W. S. Brown
KP Murphy
KY Yip
L Song
LJ Kogelman
ME Studham
MV DiLeo
N Friedman
N Friedman
N Omranian
Nikoleta Tzioutziou
NS Watson-Haigh
P Bellot
P Langfelder
P Sarder
PB Madhamshettiwar
Ping Lin
R Albert
R Dehghannasiri
RJ Flassig
RJ Prill
Robbie Waugh
Runxuan Zhang
S Ballouz
S Bornholdt
S Kim
S Martin
S Rogers
S Roy
SD Walter
SM Ud-Dean
SM Ud-Dean
T Bulcke Van den
T Saito
T Schaffter
TM Cover
V Huynh-Thu
W Zhao
WC Young
Wenbin Guo
Y Tu
Y Zuo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2017
Field of study

Abstract Background Co-expression has been widely used to identify novel regulatory relationships using high throughput measurements, such as microarray and RNA-seq data. Evaluation studies on co-expression network analysis methods mostly focus on networks of small or medium size of up to a few hundred nodes. For large networks, simulated expression data usually consist of hundreds or thousands of profiles with different perturbations or knock-outs, which is uncommon in real experiments due to their cost and the amount of work required. Thus, the performances of co-expression network analysis methods on large co-expression networks consisting of a few thousand nodes, with only a small number of profiles with a single perturbation, which more accurately reflect normal experimental conditions, are generally uncharacterized and unknown. Methods We proposed a novel network inference methods based on Relevance Low order Partial Correlation (RLowPC). RLowPC method uses a two-step approach to select on the high-confidence edges first by reducing the search space by only picking the top ranked genes from an intial partial correlation analysis and, then computes the partial correlations in the confined search space by only removing the linear dependencies from the shared neighbours, largely ignoring the genes showing lower association. Results We selected six co-expression-based methods with good performance in evaluation studies from the literature: Partial correlation, PCIT, ARACNE, MRNET, MRNETB and CLR. The evaluation of these methods was carried out on simulated time-series data with various network sizes ranging from 100 to 3000 nodes. Simulation results show low precision and recall for all of the above methods for large networks with a small number of expression profiles. We improved the inference significantly by refinement of the top weighted edges in the pre-inferred partial correlation networks using RLowPC. We found improved performance by partitioning large networks into smaller co-expressed modules when assessing the method performance within these modules. Conclusions The evaluation results show that current methods suffer from low precision and recall for large co-expression networks where only a small number of profiles are available. The proposed RLowPC method effectively reduces the indirect edges predicted as regulatory relationships and increases the precision of top ranked predictions. Partitioning large networks into smaller highly co-expressed modules also helps to improve the performance of network inference methods. The RLowPC R package for network construction, refinement and evaluation is available at GitHub: https://github.com/wyguo/RLowPC

Crossref

Directory of Open Access Journals

University of Dundee Online Publications